Forward and Backward Speech Skimming with the Elastic Audio Slider

نویسندگان

  • Wolfgang Hürst
  • Tobias Lauer
  • Cédric Bürfent
  • Georg Götz
چکیده

In pursuit of the goal to make recorded speech as easy to skim as printed text, a variety of methods and user interfaces have been suggested in the literature, involving time-compressed audio, speech segmentation and recognition, etc. We propose a new user interface, the elastic audio slider, which makes navigation in speech documents similar to video navigation or text scrolling. The approach supports navigation at variable speed in both forward and backward direction while providing immediate intelligible audio feedback during the user’s interactions. A user study was conducted to prove the usefulness of backward replay of speech for tasks such as topic classification. In addition, we show that the proposed interface offers the opportunity to combine the advantages of existing approaches within a single, easy-to-use UI component that complements and enhances the common user interfaces known from standard audio player software.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Interactive Manipulation of Replay Speed

Today’s interfaces for time-scaled audio replay have limitations especially regarding highly interactive tasks such as skimming and searching, which require quick temporary speed changes. Motivated by this shortcoming, we introduce a new interaction technique for speech skimming based on the so called rubberband metaphor. We propose an “elastic” audio slider which is especially useful for tempo...

متن کامل

Implementing the New First and Second Differentiation of a General Yield Surface in Explicit and Implicit Rate-Independent Plasticity

In the current research with novel first and second differentiations of a yield function, Euler forward along with Euler backward with its consistent elastic-plastic modulus are newly implemented in finite element program in rate-independent plasticity. An elastic-plastic internally pressurized thick walled cylinder is analyzed with four famous criteria including both pressure dependent and ind...

متن کامل

New Touch Screen Application

An adaptive speech rate control technology for ultra fast listening that is equivalent to skimming is described. Nowadays, listening to audio books on mobile devices is quite common. People read books at various levels of detail from close reading to skimming. Although a similar feature to skimming is required to efficiently obtain information from audio sources, there is no tool equivalent to ...

متن کامل

A Turbo-Decoding Weighted Forward-Backward Algorithm for Multimodal Speech Recognition

Since the performance of automatic speech recognition (ASR) still degrades under adverse acoustic conditions, recognition robustness can be improved by incorporating further modalities. The arising question of information fusion shows interesting parallels to problems in digital communications, where the turbo principle revolutionized reliable communication. In this paper, we examine whether th...

متن کامل

Turbo Decoders for Audio-Visual Continuous Speech Recognition

Visual speech, i.e., video recordings of speakers’ mouths, plays an important role in improving the robustness properties of automatic speech recognition (ASR) against noise. Optimal fusion of audio and video modalities is still one of the major challenges that attracts significant interest in the realm of audiovisual ASR. Recently, turbo decoders (TDs) have been successful in addressing the au...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005